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Abstract 

We consider the recent formulation of the Algorithmic Lovasz Local Lemma pin for finding 
objects that avoid “bad features”, or “flaws”. It extends the Moser-Tardos resampling algo¬ 
rithm m to more general discrete spaces. At each step the method picks a flaw present in the 
current state and “resamples” it using a “resampling oracle” provided by the user. However, it 
is less flexible than the Moser-Tardos method since Pi require a specific flaw selection rule, 
whereas m allows an arbitrary rule (and thus can potentially be implemented more efficiently). 

We formulate a new “commutativity” condition, and prove that it is sufficient for an arbitrary 
rule to work. It also enables an efficient parallelization under an additional assumption. We 
then show that existing resampling oracles for perfect matchings and permutations do satisfy 
this condition. 

Finally, we generalize the precondition in P (in the case of symmetric potential causality 
graphs). This unifies special cases that previously were treated separately. 


1 Introduction 

Let H be a (large) set of objects and F be a set of flaws, where a flaw f £ F is some non-empty set 
of “bad” objects, i.e. f F Q. Flaw / is said to be present in a ii a £ f . Let ¥„ = {f £ F \ a £ f} 
be the set of flaws present in cr. Object a is called flawless if Fo- = 0. 

The existence of flawless objects can often be shown via a probabilistic method. First, a prob¬ 
ability measnre a; on 11 is introduced, then flaws in F become (bad) events that should be avoided. 
Proving the existence of a flawless object is now equivalent to showing that the probability of avoid¬ 
ing all bad events is positive. This holds if, for example, all events f £ F are independent and the 
probability of each / is smaller than 1. The well-known Lovasz Local Lemma (LLL) [5] is a pow- 
erfnl tool that can handle a (limited) dependency between the events. Roughly speaking, it states 
that if the dependency graph is sparse enough (e.g. has a bounded degree) and the probabilities of 
individual bad events are sufficiently small then a flawless object is guaranteed to exist. 

LLL has been the subject of intensive research, see e.g. [20] for a relatively recent survey. One 
of the milestone results was the eonstructive version of LLL by Moser and Tardos m- It applies to 
the variable model in which = Ai x ... x A„ for some discrete sets A,, event / depends on a small 
snbset of variables denoted as vbl(/) C [n], and two events f,g are declared to be dependent if 
vbl(/) n vbl(g) 0. The algorithm proposed in [T6| is strikingly simple: (i) sample each variable 
cjj for i £ [n] according to its distribution; (ii) while F^- is non-empty, pick an arbitrary flaw f £ F^ 
and resample all variables Oi for i £ vbl(/). Moser and Tardos proved that if the LLL condition 
in [5] is satisfied then the expected number of resampling is small (polynomial for most of the 
known applications). 

The recent development has been extending algorithmic LLL beyond the variable model, and 
in particular to non-Cartesian spaces. The first such work was by Harris and Srinivasan |8|, who 
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considered the space of permutations. Achlioptas and Iliopoulos [T] introduced a more abstract 
framework where the behaviour of the algorithm is specified by a certain multigraph. Harvey 
and Vondrak proposed a framework with regenerating resampling oracles [9], providing a more 
direct connection to LLL. Achlioptas and Iliopoulos [2j extended the framework to more general 
resampling oracles. 

In this paper we study this setting from It does not assume any particular structure on 

sets Q and F. Instead, for each object a £ Q and flaw f £ F^j the user must provide a “resampling 
oracle” specified by a set of actions A{f, u) C O that can be taken to “address” flaw /, and a 
probability distribution p{a'\f,a) over u' G A{f,a). At each step the algorithm selects a certain 
flaw / £ F„, samples an action a' £ A{f, a) according to p{cr'\f, a), and goes there. This framework 
captures the Moser-Tardos algorithm [16], and can also handle other scenarios such as permutations 
and perfect matchings (in which case H cannot be expressed as a Cartesian product). 

One intriguing difference between the methods of |16j and mum is that |16) allows an arbitrary 
rule for selecting a flaw f £ F^^, whereas mum require a specific rule (which depends on a 
permutation it of F chosen in advance)]^ We will say that a resampling algorithm is flexible if it is 
guaranteed to work with any flaw selection rule. We argue that flexibility can lead to a much more 
efficient practical implementation: it is not necessary to examine all flaws in Tq-, the first found 
flaw will suffice. If the list of current flaws is updated dynamically then flexibility could potentially 
eliminate the need for a costly data structure (such as a priority queue) and thus save a factor 
of 0(logn) in the complexity. The rule may also affect the number of resamplings in practice; 
experimentally, the selection process matters, as noted in m- 

Achlioptas and Iliopoulos discuss flaw selection rules in [U Section 4.3], and remark that they 
do not see how to accommodate arbitrary rules in their framework. It is known, however, that 
in special cases flexible rules can be used even beyond the variable model. Namely, through a 
lengthy and a complicated analysis Harris and Srinivasan [8] managed to show the correctness of 
a resampling algorithm for permutations, and did not make assumptions on the flaw selection rule 
in their proof. They also proved a better bound for the parallel version of the algorithm. 

This paper aims to understand which properties of the problem enable flexibility and parallelism. 
Our contributions are as follows. 

• We formulate a new condition that we call “commutativity”, and prove that it is sufficient 
for flexibility. 

• We prove that it gives a better bound on the number of rounds of the parallel version of the 
algorithm. In particular, we show how to use commutativity for handling “partial execution 
logs” instead of “full execution logs” (which is required for analyzing the parallel version). 

• We show that existing resampling oracles for permutations [8] and perfect matchings in 
complete graphs are commutative. (In fact, we treat both cases in a single framework). 
Thus, we provide a simpler proof of the result in [8] and generalize it to other settings, 
in particular to perfect matchings in certain graphs (for which existing algorithms require 
specific rules). 

• We generalize the condition in [2] for the algorithmic LLL to work (in the case of symmet¬ 
ric potential causality graphs). The new condition unifies special cases that were treated 
separately in [2]. 

To our knowledge, our commutativity condition captures all previously known cases when the flaw 
selection rules was allowed to be arbitrary. 

^The papers mm actually allowed more freedom in the choice of permutation tt, e.g. it may depend on the 
iteration nnmber. However, once n has been chosen, the algorithm shonld still examine some “current” set of flaws 
and pick the lowest one with respect to tt. 
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Other related work Applications that involve non-Cartesian spaces n (such as permutations, 
matchings and spanning trees) have often been tackled via the Lopsided LLL [6] ; we refer to |14( fT5] 
for a comprehensive survey. On the level of techniques there is some connection between this paper 
and a recent work by Knuth [lO]; we discuss this in Section]^ 

2 Background and preliminaries 

First, we give a formal description of the algorithm from Pi. Assume that for each object 
IT G 0 and each flaw f € there is a non-empty set of actions A(f, a) F Ll that can be taken 
for “addressing” flaw / at a, and a probability distribution p{a'\f,a) over a' G A{f,a). Note, by 
definition A{f,a) is the support of distribution p{-\f,a). The collection of all resampling oracles 
will be denoted as p. We fix some probability distribution a; on Q with uj{a) > 0 for all u G O 
(it will be used later for formulating various conditions). Note that our notation is quite different 
from that of Harvey and Vondrak P0 The algorithm can now be stated as follows. 

Algorithm 1 Random walk. Input: initial distribution over fl, strategy A. 

1: sample cr G If according to 
2: while F(j non-empty do 
3: select flaw f £ F„ according to A 

4: sample a' G A{f,a) according to distribution p{a'\f,a), set cr £- o'. 

5: end while 


Clearly, if the algorithm terminates then it produces a flawless object 0. The works PPP used 
specific strategies A. As stated in the introduction, our goal is to understand when an arbitrary 
strategy can be used. This means that flaw / in line 3 is selected according to some distribution 
which is a function of the entire past execution histor}0 Note that if flaw / G To- in line 3 depends 
only on 0 then the algorithm can be viewed a random walk in a Markov chain with states fl, while 
in a more general case the walk can be non-Markovian. 

2.1 Walks and the potential causality graph 

We say that a A u' is a (valid) walk if it is possible to get from state 0 to o' by “addressing” flaw 
/ as described in the algorithm, i.e. if two conditions hold: f £ F^ and o' £ A(f,o). Whenever we 

write 0^0', we mean that it is a valid walk. 

In many applications resampling oracles satisfy a special condition called atomicity [T]. 

Definition 1. p is called atomic if for any f £ F and o' £ LI there exists at most one object o £ LI 
such that o A- o'. 

Next, we need to describe “dependences” between flaws in F. Let ~ be some symmetric relation 
on F (so that (T, ~) is an undirected graph). It is assumed to be fixed throughout the paper. For 
a flaw / G T let r(/) = {g £ F \ f g} be the set of neighbors of /. Note, we may or may not 
have f ^ f, and so T(f) may or may not contain f. We will denote r+( f) = r(f) U { and also 
r(5) = UfesW) and T+{S) = for a subset S C F. 

^ “Flaws” / correspond to “bad events” Fi in [9]. The distribution over 12 was denoted in [9] as /r, the states of 
f2 as w, and the resampling oracle for the bad event Ei at state iv € 12 as ri(w). 

^The description of the algorithm in [T^ says “pick an arbitrary violated event’. This is consistent with our 
definition of an “arbitrary strategy”: in the analysis Moser and Tardos mention that this selection must come from 
some fixed procedure (either deterministic or randomized), so that expected values are well-defined. 
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Definition 2. Undirected graph (F, ~) is called a potential causality graph for p if for any walk 
a A- a' there holds F^i C — {/}) U r(/). 

In other words, r(/) must contain all flaws that can appear after addressing flaw / at some 
state. Also, r(/) must contain / if addressing / at some state can fail to eradicate /. 

Note that in Definition]^ we deviated slightly from m- in their analysis the potential causality 
graph was directed and therefore in certain cases could capture more information about D. While 
directed graphs do matter in some applications (see examples in mm), we believe that in a typical 
application the potential causality relation is symmetric. Using an undirected graph will be essential 
for incorporating commutativity. 

A subset S' C F will be called independent if for any distinct f,g € S we have f g- (Thus, 
loops / ~ / in the graph (F, ~) do not affect the definition of independence). For a subset S C F 
we denote Ind(S) = {T C S | T is independent}. 


2.2 Commutativity 

We now formulate new conditions that will allow an arbitrary flaw selection rule to be used. 


Definition 3. (/?, ~) is called weakly commutative if there exists a mapping SWAP that sends any 
walk ai A a 2 A with f oo g to another valid walk ai A a 2 A as, and this mapping is injective. 

Note that in the atomic case the definition can be simplified. Namely, (/?, ~) is weakly commu¬ 
tative if and only if it satisfies the following condition: 

• For any walk ai A a 2 A as with f g there exists state G D such that ai A a!^ A as is 
also a walk. 

Indeed, by atomicity the state ci^ is unique, and so mapping SWAP in Definition is constructed in 
a natural way. This mapping is reversible and thus injective. 

For several results we will also need a stronger property. 


Definition 4. (p, ~) is called strongly commutative (or just commutative ) if for any walk r = 
ai A a 2 A as with f oo g and SWAP(t) = ai A a 2 A as there holds 


pA2\f,o-l)picrs\g,cr2) = picr2\9,cri)pA3\f,cr2) 


( 1 ) 


It is straightforward to check that strong commutativity holds in the variable model of Moser 
and Tardos. In fact, an additional property holds: for any ai A a2 A as with f oo g there 

exists exactly one state G D such that ai A a 2 A as. Checking strong commutativity for 
non-Cartesian spaces D is more involved; we refer to Section]^ for details. 


2.3 Parallel version 

We will also consider the following version of the algorithm (see Algorithm [^ . It is equivalent to 
the parallel algorithm of Moser and Tardos |lfi) in the case of the variable model, and to the parallel 
algorithm of Harris and Srinivasan [8] in the case of permutations. It is also closely related to the 
“MaximalSetResample” algorithm of Harvey and Vondrak [9] (see below). 

Lines 3-8 will be called a round. In some cases each round admits an efficient parallel imple¬ 
mentation (with a polylogarithmic running time). For example, this holds in the variable model of 
Moser and Tardos |16j . Also, Harris and Srinivasan [8] presented an efficient implementation for 
permutations. Accordingly, we will be interested in the number of rounds of the algorithm. 
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Algorithm 2 Parallel random walk. 

1: sampe a G 0 according to distribution 
2: while non-empty do 
3: set / = 0 

4: while set F^j — r'^(/) is non-empty do 

5: pick some f € F^ — r’^(/) 

6: sample a' G A{f,a) according to p{a'\f,a), set a <— ex'. 

7: set I <— / U {/} 

8 : end while 

9: end while 


Note, during round r set i^o- — in line 5 shrinks from iteration to iteration (and so flaw / 

in line 5 satisfies / G i^o-r) where cr^ is the state in the beginning of round r). This property can be 
easily verified using induction and Definition 

TT-stable strategy Let us fix a total order on F defined by some permutation vr of F. Consider 
a version of Algorithm where flaw / in line 5 is selected as the lowest flaw in F^ — (with 

respect to This corresponds to Algorithm with a specific strategy A; this strategy will be 
called TT-stable. It coincides with the MaximalSetResample algorithm of Harvey and Vondrak [9j. 

Although we focus on the commutative case, we will also state results for 7r-stable strategies 
since they follow automatically from the proof (which is based on the analysis of TT-stable walks). 


2.4 Algorithmic LLL conditions 

In this section we formulate sufficient conditions under which a flawless object will be guaranteed to 
exist. The conditions involve two vectors, A and p. Roughly speaking, A characterizes resampling 
oracles and p, characterizes graph (F, ~). 


Definition 5. The pair (p, ~) is said to satisfy Algorithmic LLL conditions if there exist vectors 
A, G such that 


A/ > p{cT'\f, 


cr 


uj{a') 


hf 




V/ G F, (t' G D 

yf^F 


(2a) 

(2b) 


seind(r{/)) 


where 9 G (0,1) is some constant and p{S) = Wg^ghg- 

Of course, vector A can be easily eliminated from Q. However, it is convenient to have it 
explicitly since in many cases it has a natural interpretation. Achlioptas and Iliopoulos |2] called A 


flaw charges, though instead of (2a) they used slightly stronger conditions. Namely, they considered 


the following cases; it is straightforward to check that in each one of them vector A satishes (2a): 


The case from their earlier work [T], which in the current terminology can be described as 
follows: cu is a uniform distribution over D, p{-\f,a) is a uniform distribution over A(f,a), 
and p is atomic. They then dehned \ f = 1/ miuo- e/ 

Regenerating resampling oracles of Harvey and Vondrak [9] specified by the equation 

= ^^(< 7 ') yf€F,a'€n 




0 - 6 / 
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where u}{f) = Achlioptas and Iliopoulos [2] defined \f = oj{f). 

In the general case, [2] defined flaw charges via 


\f = bf max 

a-e/,cr'eA(/,cr) 

where bf = max |{(T G f : a' G A{f, it)}|. 

ct'gQ 


pKI/>o-) 


w(fT') 


Remark 1. An alternative condition that appeared in the literature (for certain X’s) is 




A 




scr(/) 


/^/ 


( 3 ) 


96 r(/) 


Clearly, (2b) is weaker than (§. We mention that (§ is analogous to the original LLL condition 
in m, while ( |2b[ ) corresponds to the cluster expansion improvement by Bissacot et al. (with 
the matching algorithmic version by Pedgen m who considered the variable model of Moser and 
Tardos). It is known that the cluster expansion version can give better results for some applications, 

see e.g. 0 [T^ [f. 


Shearer’s condition Shearer [19] gave a sufficient and necessary condition for a general LLL 
to hold for a given dependency graph. Kolipaka and Szegedy im showed that this condition is 
sufficient for the Moser-Tardos algorithm, while Harvey and Vondrak generalized the analysis 
to regenerating resampling oracles. We will show that the same analysis holds for the framework 
considered in this paper. 

Consider vector p G For a subset 5 C F denote p^ = OfesP/i this is a monomial in 

variables {pf \ f G F}. Also, define polynomial qs as follows: 


qs = qsip) = 

/:5C/eind(F) 


( 4 ) 


Definition 6. Vector p is said to satisfy the Shearer’s condition if qs{p) > 0 for all S V F, and 
q0{p) > 0 . 

The pair (p, ~) is said to satisfy Shearer’s condition if there exist vector p satisfying Shearer’s 
condition, vector A satisfying (2a), and a constant 9 G (0,1) such that Xf < 9 ■ pf for all f G F. 


3 Our results 

First, we state our results for the sequential version (Algorithm [^. Unless mentioned otherwise, 
the flaw selection strategy and the initial distribution are assumed to be arbitrary. 

Theorem 7. Suppose that (p, ~) satisfies either condition © or the Shearer’s condition, and one 
of the following holds: 

(a) Algorithm^ uses a ir-stable strategy. 

(b) (p, ~) is weakly commutative and atomic. 

(c) (p, ~) is strongly commutative. 

Define 

U, 7 esnpp(<.,;i-t) Ind(F^) in cases (a,b) 

Ind(F) in the case (c) 


init 


CO 


init 
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= max — 
o-en w((t) 


Ind“"* = 
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where supp(a;^'^^^) = {a G > 0} is the support o/a;^°^^. The probability that Algorithm^ 

produces a flawless object in fewer than T + r steps is at least 1 — 6 '^ where 


T 


log0 


-1 


log 7 ""“+log ^ fi{R) 


( 5 ) 


and /u(i?) = O/ei?/^/ condition or p.{R) = (in the case of the Shearer’s 

condition). 

Note that part (a) of Theorem is a minor variation of existing results [H [2] (except that our 
precondition (2a) unifies conditions in previous works - see Section 2.4): 

• Harvey and Vondrak [9] proved Theorem j^a) in the case of regenerating oracles and distri¬ 
butions = uj, with a slightly different expression for T. 

• Achlioptas and Iliopoulos [2] proved the result for the “RecursiveWalk” strategy in the 
special cases described in Section 2.4 (and assuming condition (2b)). 

Parts (b,c) are new results. 


Remark 2 . The possibility of using distribution which is different from ui was first proposed 

by Achlioptas and Iliopoulos in W- Namely, they used a distribution with |supp(u;“^*)| = 1 , and 
later extended it to arbitrary distributions in If. There is a trade-off in choosing 

smaller supp(a;^“^^) leads to a smaller set but increases the constant 7^“^^^. It is argued 

in If that using / uj can be beneficial when sampling from uj is a difficult problem, or when 

the number of flaws is exponentially large. 


Next, we analyze the parallel version. 

Theorem 8. Suppose that (p, ~) satisfies either condition ([^ or the Shearer’s condition, and is 
strongly commutative. Then the probability that Algorithm produces a flawless object in fewer 
than T -\- r rounds is at least 1 — 6 ^ where 


T 


1 

log0~^ 


logy^""^ + log^p/ 
f&F 


( 6 ) 


where 7“^* 
tion). 


is the constant from Theorem 


?, and Hf 


g{/}(p) 

90 (p) 


(in the case of the Shearer’s condi- 


Our techniques The general idea of the proofs is to construct a “swapping mapping” that 
transforms “walks” (which are possible executions of the algorithm) to some canonical form by 
applying swap operations from Dehnitionj^ Importantly, we need to make sure that the mapping 
is injective: this will guarantee that the sum over original walks is smaller or equal than the sum over 
“canonical walks”. We then upper-bound the latter sum using some standard techniques [mi- 
We use two approaches: 

1. Theorem j^b): transforming walks to “forward stable sequences” (a forward-looking analy¬ 
sis). This works only in the atomic case (under the weak commutativity assumption), and 
can make use of the knowledge of the set supp(a;^“^^), leading to a tighter dehnition of the 
set Ind^i^. 
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2. Theorems l^c) and transforming walks to “backward stable sequences” (a backward¬ 
looking analysis). This works in the non-atomic cases, but requires strong commutativity. 
In this approach the “roots” of stable sequences are on the right, and have no connection to 
^]^-g jy^gg^] 2 s that we must use = Ind(T). 

Analyzing the parallel version requires dealing with “partial execution logs” instead of “full 
execution logs”. It appears that this is possible only with backward sequences. 

Note that previously a backward-looking analysis (with either “stable sequences” or “witness 
trees”) was used for the variable model of Moser and Tardos nsumiisi, while a forward-looking 
analysis was used for LLL versions on non-Cartesian spaces [IlEllS] and also on Cartesian spaces [7]. 

After the first version of this work |12j we learned about a recent book draft by Knuth m- 
He considers the variable model of Moser-Tardos, and gives an alternative proof of algorithm’s 
correctness which is also based on swapping arguments (justified by a technique of “coupling” two 
random sources, similar to [l6j). We emphasize that we go beyond the variable model, in which 
case justifying “swapping” seems to require different techniques. 

The proofs of Theorems and are given below in Sections and (A brief overview of these 
proofs can be found in m)- The most technical part is probably constructing an injective swapping 
mapping for transforming to backward stable sequences (proved in Section 5.6). In Section]^ we 
describe our third result, which is a proof of strong commutativity of some existing resampling 
oracles. We also consider one application, namely rainbow matchings in complete graphs. 


4 Proof of Theorem [^a) 

First, we will describe those results that do not involve commutativity. In particular, we will 
prove Theorem l^a). Techniques used in this section are a combination of ideas from previous 
works UDEEIE]. 

We write f — g for flaws f,g € F if either f ^ g or f = g (and f ^ g otherwise). 

"I/Il 'XU't 

A walk of length t is a sequence r = cii <T 2 ... uj at+i such that Wi G and cTj+i G 
A(wi,ai) for i G [t]. Its length is denoted as |r| = t. For such a walk we define quantity 

p{t) =a;"“^^(cTi) • p{ai+i\wi,ai) (7) 

ie[t] 

Let A be the strategy for selecting flaws used in Algorithm We assume in the analysis that 
this strategy is deterministic, i.e. the flaw Wi in a walk r = ui ^ ... (Ti ^ ... is uniquely 
determined by the previous history n = ... at. This assumption can be made w.l.o.g.: 

if A is randomized (i.e. a distribution over some set of deterministic strategies) then the claim of 
Theorem]^ can be obtained by taking the appropriate expectation over strategies (whose number 
is finite for a fixed finite t). A similar argument applies to Theorem]^ 

A walk r of length t that can be produced by Algorithm with a positive probability will be 
called a bad t-trajectory. Equivalently, it is a walk that starts at a state a G supp(a;’’'“’^^) and follows 
strategy A. Note that it goes only through flawed states (except possibly the last state). Let Bad(f) 
be the set of all bad t-trajectories. Clearly, for any r G Bad(t) the probability that the algorithm 
will produce r equals p(r), as defined in Q. This gives 

Proposition 9. The probability that Algorithm^ takes t steps or more equals X]TeBad(t) ■ 

If VF = wi... wt is the complete sequence of flaws in a walk r then we will write t = W. If 
we want to indicate certain intermediate states of r then we will write them in square brackets at 
appropriate positions, e.g. t = [ai]wiW 2 [(T^]w 4 W^[cFQ]. 



In general, a sequence of flaws will be called a word, and a sequence of flaws together with some 
intermediate states (such as [(Ti]wiW 2 [u^]wiW^[aQ\) will be called a pattern. For a pattern X we 
define {X) = {r | r = X} to be the set of walks consistent with X. The length of X (i.e. the number 
of flaws in it) is denoted as \X\. 

Lemma 10. For any word W and state a we have 


T^{W[cr\) 


V{r) 

T&{W) 


< 7 ^“^^ • X^r ■ io{a) 


( 8 a) 

( 8 b) 


where for a word W = wi ... wt we denoted Xw = OieM described in the previous para¬ 

graph, (VF[cr]) is the set of walks r whose sequence of flaws is W and the last state is a.) 


Proof. Summing ( 8 a) over a £ Q gives ( 8 b), so it suffices to prove the former inequality. We use 
induction on the length of W. If W is empty then (kF[(T]) contains a single walk r with the state 
cr; we then have p(r) = ti;^°^^((T), and the claim follows from the definition of 7 ^“^^^ in Theorem]^ 
This establishes the base case. Now consider a word W = Wf with f £ F and a state a'. We can 
write 


Y = Y Y pi'^) ■ 

o-:a'£A{f,a) T£{W[cr]) 


(a) _—_ . . (b) . . . . 

< Y 7^“^* • Aw • w(cj) • p(cr'|/,o-) < 7^”^* • Aw • [A/•a;((T')] = • Arv'• u;((t') 

o-:a'£A{f,o-) 


where (a) is by the induction hypothesis, and (b) follows from (2a). This gives the induction step, 
and thus concludes the proof of the lemma. □ 


The following technical result will also be useful. 

Lemma 11. Consider a walk r= ... [cr]ui ... rt*,... where k > 1. Suppose at least one of the 
following holds: 

(a) Uk is not present in a. 

(b) ui = Uk. 

Then there exists index i £\k — 1] such that Ui ^ u^. 

Proof. We will assume that t = ... ai ^ a 2 ^ ... ^ (Xk+i... where a\ = o. 

(a) Flaw Uk is present in cjfc (since ak ^ (Xk+i is a walk) but not in ai. Thus, there must exist 
index i £ [k — 1] such that Uk is present in Ui+i but not in ai. We know that at ^ cTj+i is a valid 
walk. Thus, addressing Ui have caused Uk to appear, and therefore Ui ^ Uk- 

(b) Assume that Uk is present in ui (otherwise the claim holds by (a)). If Uk is present in <72 

then Uk ~ Uk (since addressing ui = Uk at state ai did not eliminate flaw Uk). Otherwise, if Uk is 
not present in cr 2 , we can apply part (a) and conclude that Ui ~ Uk for some i G [2, A: — 1]. □ 


4.1 Stable walks and stable sequences 

As shown in [9], if A is a 7 r-stable strategy then walks r that it produces have a special structure: 
the word W corresponding to r can be uniquely described by a stable sequence. This section gives 
all necessary definitions. 
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Definition 12. A sequence of sets (p = {Ii,..., Ig) with s > 1 is called stable if Ir G Ind(F) for 
each r G [s] and Ir+i C r'''(/r) for each r G [s — 1]. 


Definition 13. A word W = wi.. .wt is called stable if it can he partitioned into non-empty words 
as W = Wi... Wg such that flaws in each word Wr are distinct, and the sequence (Ii,... ,Ig) is 
stable where Ir is the set of flaws in Wr (for r G [s] If in addition each word Wr = Wi .. .Wj 
satisfies Wi Wj then W is called vr-stable. 

A walk T = W is called stable (ir-stable) if the word W is stable (-n-stable). 


It can be seen that for a stable word the partitioning in Definition is nniqne, and can be 
obtained by the following algorithm. Start with one segment containing wi, and then for i = 2,... ,t 
do the following: if there exists flaw Wk in the last segment with Wk — Wi then start a new segment 
containing Wi, otherwise add Wi to the last segment. (If this algorithm is applied to an arbitrary 
word W then it may fail to produce a stable sequence since in the latter case, when Wi is added to 
the last segment Ir, Wi may not belong to 

Let Stabir be the set of vr-stable words W that satisfy the following condition: 

• There exists walk r such that either t = W or t = REV [ID], where REV [ID] = wt ■ ■ .wi is the 
reverse of word ID = wi ... wt. 

For a stable word ID let Rw be the first set (the “root”) of the stable sequence p = (Ii,... ,/s) 
corresponding to ID, i.e. Rw = h- (If ID is empty then Rw = 0 ). Denote Stab^(/?) = {ID G 
Stab^r : Rw = R}, Stab^(t) = {ID G Stab.^- : |ID| > t} and Stab 7 r(/?, t) = Stab^(/i) n Stab^(t). 
The following result is proven in Section using techniques from 


Theorem 14. Suppose that {p, ~) satisfies either the cluster expansion 
Shearer’s condition from Definition(^ Then 


condition (2b) or the 


Xw < KR) • V/? G Ind(i^) 

WSStab7r(R,t) 


(9) 


4.2 Proof of Theorema): a wrap-up 

It is not difficult to show that a vr-stable strategy produces vr-stable walks (see the statement below). 
The reverse, however, is not necessary true: it may e.g. happen that some flaw / does not appear 
in a vr-stable walk r, but is present in all states of r and would have been selected by any vr-stable 
strategy. 

Proposition 15. Suppose that strategy A is implemented as in Algorithm^ (with a deterministic 
choice in line 5). Then any r G Bad(t) is a stable walk. If in addition A is a ir-stable strategy (i.e. 
flaw f in line 5 is chosen as the lowest flaw in Fu — r'’'(/) with respect to then any r G Bad(t) 
is a Testable walk. 

Proof. Let s be the number of rounds of Algorithm that produced walk r, and R C [s] be the set 
of flaws addressed in round r G [s], or equivalently the set I at the end of round r (with a possible 
exception for r = s: R may correspond to the “intermediate” set /, depending on where walk r 
was “cut”). By this definition, we have r = IDi... ID, where ID,, is a word containing the flaws in 
R in some order (with \ Wr\ = \R\). We will prove that {R, ... , 1 ^) is a stable sequence; this will 
imply the first claim of the proposition. 

The independence of each set R follows directly from the construction. Consider r G [2, s], and 
let T = ... Wr-i[o']Wr .... We need to show that for each flaw / present in ID^ (i.e. / G Ir) we have 
/ G r''’(/,._i). Suppose it is not the case. Lemma[ll|[a) gives that / is present in a. Therefore, set 
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Ffj — is non-empty (it contains a). But then round r — 1 would not have terminated at 

the state a - a contradiction. 

It remains to consider the case when flaw / in line 5 is chosen as the lowest flaw in Fu — 
with respect to Consider round r, and let Wr = Wi...Wj. Definition and an induction 

argument show that during this round set F^j — r"^(/) in line 5 shrinks from iteration to iteration. 
This implies that Wi Wj, as desired. □ 

We also need the following observation. 

Lemma 16. If t = W is a t:- stable walk starting at state ai then Ry/ G Ind(Tlj^). 


Proof. By definition of a stable word, set Rw is independent, and corresponds to some prefix 
wi .. .Wk of the word W. It remains to show that for each i G [k] we have Wi G Fu^. This follows 
from Lemma [TT|(a) and the condition that wj oo Wi for all j G — 1]. □ 


We now have all ingredients to prove part (a) of Theorem Consider Algorithm with a 
TT-stable strategy. Then each walk t = W from Bad(t) is vr-stable, with W G Stab,r(i)- the 

we also know that Rw £ = jj^ 


definition of Bad(f) and Lemma 
Therefore, 
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lnd{Frj 


Pr [ifsteps > t] = E Pir) < Y. Y. 

reBad(t) WeStab^(i?,t) Te{W) 

< E E • E 

WeStahT,(R,t) Reindi"it 


where (a) holds by Lemma 10 (b) holds by Theorem 14 and T is given by the expression in ([^. 


5 Commutativity: Proof of Theorems [^b,c) and 

f 

From now on we assume that {p, ~) is weakly commutative. Therefore, for any walk r = ... fii —?■ (T 2 ■ 
with f ^ g there exists another walk t' = ... cJi A cr^ ^ era ... obtained from r by applying the 

SWAP operator to the subwalk ui A cr 2 A cjs. Such operation will be called a valid swap applied to 
T. A mapping <I> on a set of walks that works by applying some sequence of valid swaps will be called 
a swapping mapping. Note that if t' = 4>(r) then the first and the last states of t' coincide with 
that of T, and \w' = Aw where t = W, t' = W'. Furthermore, if (p, ~) is strongly commutative 
then p{t') = p{t). 

We now deal with the case when A is an arbitrary deterministic strategy, and so walks r G Bad(t) 
are not necessarily vr-stable. Our approach will be to construct a bijeetive swapping mapping 
that sends walks r G Bad(t) to some canonical walks, namely either to vr-stable walks (which will 
work only in the atomic case) or to the reverse of such walks (which will work in the general case). 

5.1 Proof of Theorem [^b) 

In this section we assume that {p, ~) is atomic. This gives the following observation. 

Proposition 17 ([I]). Walk r = cji A £72 ... cJt A at+i can be uniquely reeonstructed from the 
sequence of flaws wi .. .wt and the final state at+i. 


£73 ... 
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Proof. By atomicity, state Uj can be uniquely reconstructed from the flaw Wi and the state CTj+i. 
Applying this argument for i = t,t — 1,... ,1 gives the claim. □ 


The proposition allows us to write walks more compactly as t = wi... wtlcrt+i]. Also, Lemma 10 
gives for a walk r = TT[(Ti+i] that 


p{t) < 7^°^^ • •a;(crt+i) 


( 10 ) 


In Section 5.5 we will prove the following result. 


Theorem 18 . Suppose that (p, ~) is atomic and weakly commutative. There exists a set of n-stable 
walks Bad,r(i) a swapping mapping : Bad(t) —> Bad 7 r(t) which is a bijection. 

We can now prove Theorem [^b): 


Pr[#steps > t] 


(b) 


p{t) < Y 

rGBad(t) T=VF[(T]GBad(t) 


T=w[o-]eBad,r7) ijeindi”“ westah.^{R,t) o-en 

< 7“'" • E E ^ 7 '“"" • E 


where in (a) we used eq. (10), (b) follows from Theorem 18, and the rest is similar to the derivation 
in Section 1121 


5.2 Reverse stable sequences 

To prove Theorem [^c) and Theorem we will use reverse stable sequences instead of forward 
stable sequences. 

Walk r will be called a prefix of a walk t' if t' starts with r. Walk r is a proper prefix of t' if 
in addition r' 7 ^ r. A word W is called a prefix of r if r = WU for some word U. 

A set of walks X will be called valid if (i) all walks in X follow the same deterministic strategy 
(not necessarily the one used in Algorithm , and (ii) for any t,t' ^ X the walk r is not a proper 
prefix of t' . 

For a walk r containing flaw / we define word VF/ as the longest prefix of r that ends with 
/. Thus, we have t = WtU where W/ = ... f and word U does not contain /. We will also allow 
/ = 0; in this case we say that any walk r contains such /, and define word Wf so that r = Wf. 
Recall that for a word W = wi ... wt its reverse is denoted as REV[IF] = wt.. .wi. The following 
result is proven in Section |5.6[ 

Theorem 19. Fix f G Tu{ 0 }, and let X^ be a valid set of walks containing f. If (p, ~) is weakly 
commutative then there exists a set of walks xl and a swapping mapping : X^ —)• xl which is 
a bijection such that 

(a) for any r G t/ the word W = REV[IT/] is -K-stable (W G Stab^j, and Rw = {/} if f ^ F; 

(b) for any word W the set {r G t/ | REV[IF/] =W} is valid. 
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5.3 Proof of Theorem [^c) 


In this case we have = Ind(T). We will use Theorem 19 with / = 0 and = Bad(t). 

Part (a) gives that for any t = W' from the word W = REV[IT'] satisfies W G Stab7r(i?, t) for 
some R G Ind(T’). We can write 


Pr[#steps >t]= ^ p{t) = ^ p(r) < ^ 

reBad(t) 

where in (a) we used bijectiveness of mapping and strong commutativity of (p, ~), and the rest 
is similar to the derivation in Section [4.21 


5.4 Analysis of the parallel version: Proof of Theorem 

We now analyze Algorithm with a deterministic choice of flaw / in line 5. Equivalently, this can 
be viewed as running Algorithm with some deterministic strategy. 

Theorem 20. Consider a word W and a valid set of walks X such that W is a prefix of every walk 
in X. Then 

^ p(r) < 7^“"^ • Aw (11) 

Proof. We use induction on X)TeA'(l'^l“l^l)- Z)TeA'(l'^l“l^l) = 0 is straightforward: 

we then have X C (W), and so the claim follows from Lemma [To| Consider a valid set X with 

— |1T|) > 1. Let T be a longest walk in X, then |f| > |1T| + 1. Let f be the proper 

prefix of f of length | f | — 1. We have t~ ^ X since A is a valid set. Define set y as follows: 
T = {t G A I f“ is a proper prefix of r}. By the choice of f we get |r| = |f| for all r G T, and 

so we must have t = f~ ^ a for some w ^ F and cj G P. Since all walks in A follow the same 

deterministic strategy, the flaw w in the expression r = r“ —)• a must be the same for all r G T- 
Thus, y = {f~ CT I cj G A} for some set of flaws Y <Z F. In fact, we must have Y C A{w,d) 
where a is the final state of f~. 

Define X~ = (A — T) U {f“}. We have 


Pir) = '^piT)- p{t )=p{f ) 
T&x tgx- T&y 


p{cr\w,a) - 1 

.a£Y 


< 0 


It is straightforward to check that set A is valid, and IT is a prefix of every walk in A . Using 
the induction hypothesis for X~ and the inequality above gives the claim for A. □ 


Consider executions of Algorithm consisting of at least s rounds. For each such execution let 
r be the walk containing flaws addressed in the first s — 1 rounds and the first flaw addressed in 
round s. Let BadPar(s) be the set of such walks r. 

We say that a word U = ui... is a chain of a walk t = wi.. .wt Y U \s a, subsequence of the 
sequence wi.. .wt and Ui = Uj+i for i G [s — 1]. 


Proposition 21. (a) For each r G BadPar(s) the length of a longest chain in r equals s. (b) Set 
BadPar(s) is valid. 


Proof. Part (a) follows directly from Proposition 15 and the definition of a stable walk. Let us 
prove (b). By construction, all walks in BadPar(s) follow the same deterministic strategy used in 
Algorithm]^ Now consider a walk t' G BadPar(s). By the definition of t' G BadPar(s), any proper 
prefix r of t' corresponds to an execution of Algorithm with at most r — 1 rounds, and so the 
length of a longest chain in r is at most r — 1. Thus, r ^ BadPar(s). □ 
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For a flaw / G F let BadPar-f^ (s 
s that ends with /. We now apply Theorem 


be the set of walks r G BadPar(s) that contain a chain of length 
with the set = BadPar'^(s). Since every walk 
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r G BadPar-^(s) has a chain of length s that ends with /, so does every walk r G fh/ (applying a 
swapping mapping to r does not affect the chain). This means that \W/\ > s for each r G df/. 


For a word 


By Theorem 
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/F G Stabjrd/}, s) let (W)-^ be the set of walks r G df/ that satisfy REV[1F/] = W. 
and the argument above we have dl/ = Upvestab ((/■> s) {W)'^. We can thus write 


E 

rGBadPar-^ (s) 


p(r) 


E 


p{t) = 


E 

VPeStab^({/},s) . 


E 

■e(vy)-' 


(b) 

Pi-r) < 


E T 

VPeStab^({/},s) 


init 


(c) 


Ary < 



Pr[grounds > s] < ^ p{t) < ^ ^ p(r) < ^ 7 ^“^^ •/ij • 0 '^ = 0^ 

TSBadPar(s) /SF rSBadPar.^(s) f&F 


where T is given by the expression in ([^. 


5.5 Proof of Theorem 18 (swapping mapping for the atomic case) 


We can extend a “valid swap” operation to words in a natural way: it is a transformation of the 
form ... fg... 1 —?■ ... gf... where f ^ g. We will write W = W if W can be obtained from W 
via a sequence of valid swaps. Clearly, “=” is an equivalence relation. It can be seen that a walk 
r = W[(t] can be transformed to a walk t' = VF'[cr] if and only if VF = W. In this case we will 
write t' = r; again, “=” is an equivalence relation on the set of walks. 


Lemma 22. For any word IF there exists a sequence of valid swaps that transforms W to a TT-stahle 
word. 


Proof. It suffices to show that IF can be transformed to a stable word via valid swaps. (Trans¬ 
forming a stable word IF to a vr-stable word is straightforward: if IF = IFi... IF* is the partition 
described in Definition [T^ then we simply need to apply swaps inside each word Wr to “sort” it 
according to Pjr; any such swap will be valid by the property of IF,.). 

Let IF = wi.. .wt. We will prove by induction on i = 1,...,t that IF can be transformed via 
valid swaps to a word IF' = wi ... Wi... wt such that the prefix wi... Wi is a stable word. The base 
case i = 1 is trivial. Suppose the claim holds for i — 1, and let us show it for i G [2, f\. 

By the induction hypothesis, VF can be transformed to a word VF' = wi... Wi-if .. .wt such that 
wi ... Wi-i is a stable word. Let wi... Wi-i = VFi... VF* be the corresponding partition described 
in Definition 13 Let VF^ be the rightmost word that contains a flaw g with f = g. (If such word 


doesn’t exist then we set r = 0; thus, r G [0, s]). If r = s then we can leave the word VF' as it is - 
it satisfies the induction hypothesis for i. 

Suppose that r G [0, s — 1]. Then we repeatedly swap / with the left neighbor, stopping when 
/ gets between words VFr and VF^+i (or at the first position, if r = 0). By the definition of r, these 
are valid swaps. The new word now satisfies the induction hypothesis for i (flaw / will be assigned 
to word VFr+i). □ 


The lemma means that there exists a swapping mapping $ that sends any walk t = W[a\ G 
Bad(t) to a vr-stable walk r' = VF'[(t]. Next, we will show that any such mapping is injective on 
Bad(t). 


14 












Lemma 23. Suppose that walks r, r' follow the same deterministie strategy and r = r'. Then 


T = T 


Proof. We use induction on the length of r. The base case is trivial: if |r| = \t'\ = 0 then condition 
T = t' implies that t = t' = [a] for some state a. 

Suppose that r = fW[a]. Condition t = t' means that t' and r start at the same state. 
Therefore, since t' and r follow the same deterministic strategy, the first flaw addressed in t' is 
the same as in r. Thus, r' = fW'\a\. Clearly, walks W[a] and 1 T'[(t] follow the same deterministic 
strategy. We will show below that W[a] = W'[cj]; this will mean that W[(t] = W'[a] by the 
induction hypothesis, thus giving the desired result. 

We need to show that W = W'. We know that fW = fW\ therefore there exists a sequence 
of words ..., with = /IT, = fW such that is obtained from l/W via a 

single valid swap operation. Let be the word obtained from by moving the first occurence 
of flaw / in to the first position. Thus, for each i G [k] we have for some 

word It can be seen that either = I/b) (if the valid swap operation applied to 

involved the first occurence of /) or is obtained from via a valid swap (otherwise). Thus, 

either or is obtained from IT^®^ via a valid swap. It remains to notice that 

= IT and IT^^) = IT'. □ 


We can now prove that <h is injective. Suppose that r, r' G Bad(t) and $(r) = <h(r'). We have 
r = <h(r) = ^>(r') = r', and so r = r'. From the previous lemma we obtain that r = r', which 


concludes the proof of Theorem 18 


5.6 Proof of Theorem 19 (swapping mapping for the non-atomic case) 


Consider a word W = wi ... wt- It will be convenient to alternatively write it as IT = wi ... 
where Wj = {wi,ni) and n* counts from the left which occurrence of the flaw Wi it is: n* = \{j G 
[z] I rcj = rcj}| >1. Note that all elements wi,..., are distinct. Tuple Wj will be called a named 
flaw. The flaw associated with the named flaw f will be denoted without the bold font as /, i.e. 
^=if,nf). 

We will denote the element of T U {0} fixed in the theorem as / (rather than /). 

For a walk r = wi... let us define a directed acyclic graph Gr = (Vt-,Et-) as follows: its 
nodes are W = {wi,..., w^}, and the set E,- contains all edges of the form (f, g) where f = g and 
f occurs in r before g, i.e. r = ...f...g.... 

For a walk r containing / we make the following definitions: 

• If / G E, let Wt be the set of flaws g G V,- from which node f can be reached in Gt, where 
f is the named flaw corresponding to the rightmost occurence of / in r. For f G W,- let dr{i) 
be the length of the longest path from f to f in G,- plus one (so that dr(f) = !)• 

• If / = 0, let Wt = Vt. For f G Wt let dr{i) be the length of the longest path from f to a 
sink of Gt plus one (so that for any sink node f G Vt we have dr{f) = 1). 

Let ITt be the word consisting of the named flaws in Wt listed in the order in which they appear 
in r. 

It is straighforward to check that applying valid swaps to r does not affect graph Gt and set Wt 
( but may affect the order of named flaws in VTt). Our goal will be to apply swaps to r so that 
in the end word ITt becomes a prefix of r, and furthermore this prefix equals the word StabT(T) 
which is defined below. 
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Definition 24. For a walk r containing f define word StabTrlT") os follows. For an integer r > 1 let 
Ir = {f G Wt I (ir(f) = r}, and let Wr be the word consisting of the named flaws in D sorted in the 
decreasing order (with respect to Then Stab 7 r(r) = Ws ■ ■ ■ Wi where s = max{(iT-(f) | f G W,-}. 

It can be seen that we have f g for any distinct f, g G 1^. Therefore, “sorting 1^ in the 

decreasing order” in the definition is a vaiid operation (where we assume that f -<t^ g if / -<tt g)- 
Cieariy, appiying a vaiid swap to r does not affect word Stab^(r) (since graph G,- and set W,- 
are unaffected). It is also not difficult to show the following. 

Lemma 25. For any r G word W = REV[Stab 7 r(r)] is ir-stable. Also, Rw = {/} if f € F. 

Proof. Let R = {f \ i G 1^} be the “unnamed” version of the set R from Definition |24[ Consider 
r G [s — 1]. From the the definition of 1^ and Ir+i we obtain that for every g G Ir+i there exists 
f G Ir with (f, g) G Ej- (implying that g G r+(/)). This gives that R+i Q Also, if / G T 

then we have Ii = {f} and so Ji = {/}. Observing that W = REV[Stab^(r)] = REV[VFi] ... REV[W<j] 
we obtain the desired claims. □ 

Our next goal is to show how to transform walk r G to a walk t' = Stab^(r)C/. We will do 
this by applying swaps to swappable pairs in r. 

Definition 26. Consider a walk r containing f. A pair (f, g) of named flaws is called a swappable 
pair in r if it can be swapped in t (i.e. r = ... fg... and f q) and either 
(i) (f,g) G (V,-W,) X W,, or 

N (f>g) G 7 - X 7 - QjTid thctT ovdcT ifi StcLby^- ^t) is diffcvcTitf St 0 .b 7 j-( t) — 

The position of the rightmost swappable pair in r will be denoted as k{T), where the position of 
(f>g) in T is the number of named flaws that precede g in r. If t does not contain a swappable pair 
then k{T) = 0. Thus, k{T) G [0, |r| — 1]. 

It can be seen that the procedure of repeatedly applying swaps to swappable pairs in r must 
terminate. Indeed, swapping a pair of the form (i) moves a named flaw in to the left, which 
can happen only a finite number of times. Similarly, swapping a pair of the form (ii) decreases the 
number of pairs {f, g} G ('^”) whose relative order in IFV is not consistent with the relative order 
in Stab^(T), which again can happen only a finite number of times. (Note, swaps of the other type 
do not affect these arguments). 

Lemma 27. Consider a walk t = AU containing f where A,U some words, and there are no 
swappable pairs inside U. Then U = BC where sequence B is a subsequence o/Stab 7 r(r), and C 
does not contain named flaws from Wt . 

In particular, if |A| = 0 and t = U does not contain a swappable pair then r = Stab^(r)C'. 

Proof. Let ui,..., u^, be the named flaws from W,- that occur in U (listed in the order of their 
appearance in U). We claim that ui ... Mm is a prefix of U. Indeed, suppose not, then [/=... fu^... 
where f ^ W^- and Uj G W,-. This means that (f, Uj) ^ Et, and so / ^ Ui. But then (f, Uj) is a 
swappable pair in U - a contradiction. 

We obtained a decomposition t = ABC where B = ui... Um- It remains to show that i? is a 
subsequence of Stab 7 r(T). For that it suffices to prove that for any i G [m — 1] the relative order of Uj 
and Uj+i in B is the same as in Stab 7 r(r). Suppose not, i.e. Stab^(r) = ... Uj+i ... Uj.... We must 
have Ui = Uj+i (otherwise (uj, Uj+i) would be a swappable pair in r, contradicting the assumption). 
Therefore, (uj,Uj+i) G E,-, implying that dr{ui) > driuii+i). Inspecting Definition]^ we conclude 
that Uj should be to the left of Uj+i in Stab.n-(r). We obtained a contradiction. □ 
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Results so far can be used to obtain a mapping and set ff/ = that satisfy condition 

To satisfy bijectiveness and condition (b), we need to be careful with the 


(a) of Theorem 19 


order in which we apply swaps to swappable pairs. We will use the following algorithm. First, let 
To = , and then for p = 0,1, 2,... do the following: 

• Let k = xascyir^Xj, ^{t). If A: = 0 then terminate. 

• For each t € Xp do the following: if A:(r) = k then swap the pair (f, g) at position k in r, 
otherwise leave r unchanged. Let T’p+i be the new set of walks. 

Let us fix a word W (over named flaws), and define = {t £ Xp \ Stab 7 r(r) = W} for an 

index p > 0. Theorem 19 will follow from the following result (note that set To[VF] C Af-f is valid 
by the assumption of the theorem). 

Lemma 28. If set T’p[IF] is valid then so is T’p+i[IF], and the mapping from T’p[IF] to T’p+i[VF] 
defined by the algorithm above is injective. 


Proof. First, let us prove injectiveness. Suppose that two distinct walks r°,r° G T’p[IF] are trans¬ 
formed to the same walk r G T’p_|_i[hF]. At least one of one walks r°, r° must have changed. Assume 
w.l.o.g. that r ^ r°. Thus, walk t° = AlgB with f ^ g is transformed to a walk T = AgiB. We 
cannot have r° = r (then walks r°,r would both be in T’p[IF], but they do not follow the same 
same deterministic strategy - a contradiction). Thus, f° = AIgB, and r was obtained from r° by 
swapping f and g. Since the SWAP operation from Definition is injective, we obtain that r° = f°. 

Now suppose that T:’p_|_i[IF] is not valid. Then it contains walks r, r of the form 


T = Wl . . . W^fg . . . 
f = Wl ... w^fg . . . 


such that f 7 ^ f, and the states in r to the left of f match the corresponding states in r to the left 
of f . Here we assume some of the symbols f, g, f, g can equal 0, which means that they don’t exist. 
We also assume that f = 0 =^> g = 0 and f = 0 => g = 0. 

Let T° and f° be respectively the walks in Xp[W] that were transformed to r and r. Since 
AplPF] is valid, at least one of them must have changed. Assume w.l.o.g. that r ^ r°. We know 


that 

(*) r° and f° follow the same deterministic strategy, and they are not proper prefixes of each 
other; 

(**) named flaws wi,..., W£, f are distinct. 

We will also implicitly use the fact that the first \ states of r match those of r, and also that 
swapping adjacent flaws only affects the state between them in a deterministic way, as specihed by 
the mapping SWAP. Four cases are possible: 

• The swapped pair in r was (wj, Wj+i) for i G [^ — 1], and so r° = wi... Wj+iWj... w^fg .... 
Using (*) and (**), we conclude that r / r°, and so r° = wi... Wj+iWj... w^fg .... (Recall 
that swaps are applied at the same position for walks in Xp). Condition (*) now gives f = f. 

• The swapped pair in r was (w£,f), and so r° = wi... fw^g.... Using (*) and (**), we 
conclude that f ^ f°, and so f ° = wi... fw^g.... Condition (*) now gives f = f. 

• The swapped pair in r was (f, g), and so r° = wi... w^gf ... and f G Wt-o . Since there are 
no swappable pairs in r° to the right of (g, f). Lemma 


27 


gives that r° = wi... w^gRC where 
R is a subsequence of Stab 7 r(T°) and C does not contain named flaws from Wt-o. Note that 
B starts with f. 
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By property (*) we get that r° = wi... w^g .... Using Lemma 27 as before, we obtain that 


r° = wi .. .w^gBC where B is a subsequence of Stab 7 r(T°) and C does not contain named 
flaws from W-fo. 

We now observe that T°,f° G d:’p[iy], i.e. Stab 7 r(T°) = Stab^(T°) = W. Therefore, Wt-o = 
Wfo. This fact together with the forms of r°, r° given above implies that B is a permutation 
of B. We also know that both B and B are subsequences of Stab^(r°) = Stab 7 r(r°) = W 
and all elements of W are distinct. Thus, we can conclude that B = B. 

We showed that t° = wi... w^gf. ... Since (g, f) is a swappable pair in r°, it is also a 
swappable pair in r° (at the same position as in r°). Therefore, r = wi... w^fg..., and so 
f = f. 

The swapped pair in r was to the right of (f, g). In this case condition (*) immediately gives 

f = f. 


□ 


6 Constructing atomic commutative resampling oracles 

Let us define a directed multigraph D as follows: its set of nodes is fl, and its set of edges is the 
set of all valid walks a a'. (Each edge of D is labeled by a flaw in F). The atomicity of p means 
any state a' € Q has at most one incoming edge in D labeled by a given flaw f € F. 

j 

Note that we can recover sets A{f, a) from D, since A{f, a) = {a' \ edge a ^ a' belongs to D}. 
Therefore, resampling oracles can be uniquely reconstructed from D, if we assume that p{a'\f,a) 
is a uniform distribution over A{f,a). 

In this section we show that resampling oracles for permutations used in mi and for perfect 
matchings in complete graphs used in [9] are strongly commutative. To prove this, we find it 
easier to take an indirect approach: first, we will describe a generic route for constructing atomic 
multigraphs, then apply it to permutations and matchings and prove weak commutativity. We will 
then see that the resulting resampling procedure coincides with that in urn- 

The constructed multigraph will satisfy the following property: for each flaw / there exists 
constant Af such that \A{f,a)\ = Aj all a ^ f. Thus, p{a'\f,a) = for u' G A{f,a), and so 
weak commutativity will imply strong commutativity. 

We start with some general observations. For an atomic multigraph D let us define a mapping 
-i/j : T X U —)• n U {undefined} that specifies a “backward step” for f € F and a' € Q as 

J 

follows: if there exists the (unique) state cr G such that a ^ a' then ip{f,a') = a, otherwise 
= undefined. Note that V' satisfies the following properties: 

(I) If 7 ^ undefined then C /. 

(II) For any f G F and u G / there exists at least one u' G with iplf, a') = a. 

It is not difficult to see that D can be uniquely reconstructed from U, F and V', since for each 
f G F and a G f we have A{f,a) = {a' G | 'ip{f,a') = a}. Furthermore, any triplet (U,F, V') 
specifies a valid atomic multigraph D, as long as ip satisfies properties (I) and (II) (and F is some 
set of non-empty subsets of Q). Property (II), in particular, is equivalent to the condition that 
A{f, a) is non-empty for each f G F and a G f. 

Thus, the problem of constructing the set of actions A{f, a) for a given /, a can be shifted to 
the problem of constructing a mapping p. Of course, after constructing p one still needs to show 
that sampling from A(f, a) can be done efficiently. 

We remark that Definition of the potential causality graph can also be reformulated in terms 
of the mapping p, as stated below (this claim follows directly from definitions). 
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Proposition 29. Undirected graph (P, ~) is a potential causality graph for D if for any f,g € F 
with f oo g and any a' £ g with tpif, a') 7 ^ undefined we have V’(/, cr') £ g and f ^ g. 

6.1 Matchings 

We now apply the route outlined above to some matching problems. Let G = {V, E) be an 
undirected graph with \V\ = 2n nodes that satisfies the following condition: 

(*) If {u',u,v,v') is a path in G with distinct nodes then {u\v'} £ E. 

We will consider the case when H is the set of perfect matchings in G (so that each object cj G is 
a subset of E). We allow any flaw of the form fM = {cr £ It \ M <£ a} where M is a fixed subset of 
E. It can be assumed w.l.o.g. that M is a matching (otherwise /m would be empty). Thus, F can 
be any subset of {/m \ M £ M.} where M. denotes the set of matchings in G, with Q C Ai. Two 
special cases of this framework have been considered mmii: 

[PI] G is the complete graph on 2n vertices, so that 11 is the set of all perfect matchings of V. 
[P2] Set V can be partitioned into disjoint subsets Ai, Bi,..., Ar, Br such that \Ai\ = \Bi\ for 
i £ [r] and E = {{u, u} | u £ Ai,v £ Bi,i £ [r]}. Thus, G is a union of r complete bipartite 
graphs, and set H corresponds to r permutations. 

In fact, these are essentially the only possibilities allowed by condition (*): it can be shown that 
if G contains at least one perfect matching then each component of G is either a complete graph 
Km or a complete bipartite graph Km,m- We will not need this claim, and so we leave it without a 
proof. Instead, we just assume that one of the two cases [P1,P2] holds (but unlike previous work, 
we will treat them in a unified way, relying mostly on condition (*)). 

We use the following potential causality graph: fM ~ /m' for fM, fu' G F if M U M' is not a 
matching or M = M'. This graph is the same or slightly smaller than graphs used previously for 
cases [PI] and [P2]. (For [P2] the works [8l|9] used a larger relation instead where /m /m' if 
M U M' is not a matching or M Cl M' / 0 ). 

We will construct a mapping if : Ai x H ^ H that satisfies M C a) for any M £ Ai and 
(T G O. It will correspond to the mapping in a. natural way, i.e. f^{fM, cr) = a) for fM £ E. 
Clearly, such ip will be defined everywhere on F x 12, and will satisfy property (I). 

Defining ip Consider M £ Ai and cr G 12. If M contains a single edge e = {u,v}, then we find 
unique u',v' with {n, tt'}, {u,F} G a, and set 

= {a - {{u,u'},{v,v'}}) U {{n, ?;}, {u', u'}} (12) 

Note that if {u,v} £ o' then 'ip{M,o) = 0 . Otherwise nodes u,v,u',v' are distinct, and we have 
ip{M, cj) G 12 by the assumption (*) on the graph G. 

Now suppose that M = {ei,..., e^} contains more than one edge. Then we define 

00 = 0 (Ti = •0({ei},cro) 02 ='ipi{e 2 },oi) ... Ok ='ipi{ek},Ok-i) (13) 

and set 'ip{M,o) = Ok. To show that this is well-dehned, we need to prove that the result does 
not depend on the chosen ordering of M. It suffices to prove this claim for \M\ = 2, then we can 
use an induction argument (since any ordering of M can be transformed to any other ordering via 
a sequence of operations that swap adjacent elements). Proving it for \M\ = 2 can be done by 
inspecting all possible cases, which are visualized in Fig. [^ verification of the claim in each case is 
left to the reader. 
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Figure 1: Possible cases for M = { 61 , 62 } G Af and a G Cl. Top row: solid lines indicate edges in a, 
dashed lines indicate edges in M. Bottom row: result of i/'dcij, ?/;({ 62 }, cr)) = i/)({ 62 }, tr)). 


Proposition 30. (a) In (13) we have (ci,..., 61 } C Uj for any i G[k]. Consequently, M C '0(M, cr) 
for any M G M. and a G H (and so property (I) holds). 

(b) For any a G Cl and M,M' G M with M UM' G M we have ,a)) = tp{M' 

Consequently, the multigraph D defined by {Cl,F,ip) is weakly commutative with respect to 

(c) Relation ^ is a potential causality graph for D. 

Proof, (a) We can use induction on i. The base case i = 0 is vacuous. The induction step for 
i follows from the definition of the mapping (Tj_i) and the fact that (Tj_i cannot contain 

edges that connect an endpoint of 6* to an endpoint of ej for j G [i — 1] (since (ci,..., 6i_i} C ai-i 
by the indnction hypothesis, and (ci, ..., 6 i} is a matching). 

(b) We claim ,'ip{M,a)) = '0(M U AF, a). Indeed, let M = {e\,...,e(} and M' = 

{6£+i, ...,6a:}. Define ao,ai,... ,ak as in ( [T^ , then = ag and = o'fc- Thus, 

it remains to show that ^(M U M',a) = cr^. For that we need to observe that if some edge 6j 
appears in the sequence the (ci, ..., Ck) the second time then we have 6* G Ui-i and consequently 
CTj = 'fi{{ei},ai-i) = (Tj-i. Thus, such 6* can be removed from the sequence without affecting the 
result. After removing duplicates we conclude that U M',a) = ak by the definition of if. 

In a similar way we can show that , cj)) = Tf{M U M', a). This proves the claim. 

(c) Let us show that conditions of Proposition 29 hold. Consider flaws / = /m, g = fM' in F 
with fM fM' and object a G fM'- Condition /m fM' holds since fM fM', so we need to 
show that 'ijj{fM,(^) G fM', or equivalently M' C 


Assume that M = { 6 i,..., 6 fc}, and define sequence ao,ai,... ,ak-i,crk = as in (13). 


Indeed, for the base case the claim M' C a holds since a G fM', and for the induction step we need 
to use the definition of if and the fact that M U M' G M. (which holds since fM fM ') ■ We leave 
verification of the induction step to the reader. □ 


Sampling from A{fM,o'k) The general idea is to “reverse” the process in eq. (13): given flaw 
fM with M = {ei,... , 6 fc} G A4 and object ak G fM, we first generate possible values for ak-i, 
then for ak- 2 , and so on. 

For a subset S C E let ^ = {{u,v), {v, u) \ {u, n} G A} be a “directed copy” of S. For an object 
a G Cl and edges {u,v), {u',v') G define 


S\}a.p^{{u,v),{u',v')) = {a - {{u,v},{u',v'}}) U {{n, n'}, {n, n'}} 
Finally, for an object a G Cl and an edge (w, v) G~^ let us define 

Maiu,v) = {iu',v') G I Swap^((n,n), (n',n')) G Dj 


(14) 


(15) 
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It can be checked that {v,u) G J\fa{u,v) and {u,v) ^ J\faiu,v). Furthermore, in the special cases 
above we have the following: 

[PI] Afa{u,v) = - {{u,v)}. 

[P2] If (u, v) G Ai X Bi then Ma{u, v) = {Bi x Ai) n 

We can now formulate the sampling algorithm (see Algorithm]^. 

Algorithm 3 Sampling from A^Jm, CTk) for M = {ei,..., e^} C G H 
1: for i = k,k — 1,... ,1 do 

2: choose orientation (tt, v) of edge e* = {«, u} 

3: select («', v') G Mai{u, v) — {ei,..., ej_i j uniformly at random 

4: set (Ti_i = Swap^,((u,u), (n',u')) 

5: end for 

6: return uo 


Let us verify the correctness of this algorithm. Using the definitions of ijj and Swap^, the 
following fact can be easily checked. 

Lemma 31. (a) Suppose that {n, v} G ai G (u', v') G v) and ai-i = Swap^. ((u, v), {u', v')). 

Then'tl>{{{u,v}},ai-i) = at. 

(b) Conversely, suppose that '0({{u, u}}, cJi_i) = ai for ei = {u,v} G E and ai-i G fl. Then there 
exists unique {u',v') G Mai{u,v) such that ai-i = Swap^,((u,u), {u',v')). Furthermore, it satisfies 
{u',v'] ^ ai-i - {ei}. 

Using this lemma, we can now show establish correctness of the sampling procedure. We say 
that two executions of Algorithm are distinct if they made different choices in line 3 for some 
i G[k]. 

Proposition 32. Algorithm{^is well-defined, i.e. in line 3 we have Ci G ai. It can generate object 
ao G It if and only if 'fi{M,ao) = ak. Finally, distinct executions produce distinct outputs. 


Proof. The proof will have two parts corresponding to two directions. 

(a) Let ak, ak-i ,... be the sequence of objects produced by the algorithm. We will show using 
induction on i = /c,..., 1, 0 that {ei,..., e*} F ai (and therefore line 3 for index i is well-defined) 
and V:({ei+i,..., ek},ai) = ak. The base case i = A: is trivial. Suppose the claim holds for i G [k], 
let us show it for i — 1. We have {ei,..., e,} C cji by the induction hypothesis; inspecting the rule 
for choosing {u', v'), we conclude that {ei,..., ej_i} C ai-i. For the second claim we can write 


^({e*,.. .,ek},ai-i) = fi{{ei+i,... ,ek},'ip{{ei},ai_i)) = fi{{ei+i,.. .,ek},ai) = ak 
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a) and third is by the 


where the first equality is by the definition of the second is by Lemma 
induction hypothesis. This concludes the argument. 

(b) Suppose that fiiM, ao) = ak for ao G H and M = {ei,..., e^}. Define objects ai,... ,ak 
as in ( |13[ ). We claim that Algorithmcan replicate this sequence (in the reverse order). Indeed, 
by Lemma [ 3 ^ b) it suffices to show that for any {u',v') G J\fafiu,v) with {u',v'} ^ cjj_i — {cj} 
we also have {u',v') G N(jfiu,v) — {ei,..., ei-i\. Suppose not, then {u',v') G {ei,..., ej_i j. By 
Proposition [30)(a) we have {ei,..., Ci-i} C Uj-i, and so {u', v'} G Uj-i. Thus, {u', v'} = e*. But Cj 
does not appear in {ei,..., ei_i}, and so we cannot have [u', v') G {ei,..., ej_i j - a contradiction. 

Let us now prove that the input M, ak and the output uo uniquely determine choices made 
during the execution (this will give the last claim of the lemma). Let dk,. ■ ■ ,ai,ao be the objects 
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produced during the execution, with = (Tk and do = cto- Set i = 1. By Lem ma [M[ a) we have 
'ip{{ei},ai-i) = dj, implying that dj = ai is determined uniquely. By Lemma |31[b) the choice 
of {u',v') in line 3 for index i is also determined uniquely from c 7 j_i,cri and ej. Repeating this 
argument for i = 2,... ,k (i.e. using induction) yields the claim. □ 

We have proved that the output of Algorithmj^is a distribution whose support is A^Jm, crk)- To 
show that this distribution is uniform, we need to observe additionally that the number of choices 
in line 3 for index i depends on i but not on the past execution history (which can be easily checked 
for cases [PI] and [P2]). The cardinality of A(/m, cr^) is the product of these numbers over i G [k], 
and thus depends only on the flaw /m (more precisely, on |M|). 

To summarize, we have constructed an atomic weakly commutative multigraph D, proved that 
Algorithm 1^ samples uniformly from A{fM, (^k), and the size of latter set depends only on Jm (the 
latter implies strong commutativity). It can now be verified that the sampling procedure coincides 
with the procedure in [9] for perfect matchings in a complete graph (in the case [PI]), and with 
the procedure in mm for permutations (in the case [P2]). 


6.2 Application: rainbow matchings in complete graphs 


We refer to lami for applications of resampling oracles for permutations and perfect matchings. 
Here we revisit just one application, namely a rainbow matching problem. Our primary goal is to 
demonstrate how the choice of the distribution affects the bound on the expected runtime, 

and also compare it with the parallel version. 

Let G = (R, E) be a complete graph on 2n vertices such that each edge is assigned a color, and 
each color appears in at most q edges. A perfect matching in G is called rainbow if its edges have 
distinct colors. Achlioptas and Iliopoulos [I] showed that a rainbow matching exists if g < yn for 
some constant 7 < A ~ 0.184. Instead of (2b), they used a stronger condition Q. Harvey and 
Vondrak [9] improved the constant to 7 = 0.21 by exploiting a condition with the cluster expansion 
correction analogous to Q. Below we redo their calculations. 

Let F be the set of flaws /m such M contains two vertex-disjoint edges of the same color, and 
assume that we use the multigraph and relation ~ constructed in the previous section. 


Proposition 33. //y = 0.21 then condition Q can be satisfied by setting Hf = fi = for f G F 
(if n is sufficiently large). 


Proof. Consider flaw /m where M = {{ui, U2}, {^3, U4}} G F. For node v G V lei r(u) C F be 

the set of flaws fM' such that at least one of the edges in M' is incident to v. Clearly, we have 

r(/M) ^ r(ui) U r(u2) U r(u3) U r(u4). Furthermore, if S' G Ind(r(/M)) then for each i G [4] we 
have |5 n r(uj)| < 1 (and consequently \S\ < 4). It can be seen that for each k G [0,4] there are 
at most Uk = (fc)(2n — 1)^(9 — 1 )^ subsets S G Ind(r(/M)) of size k. Indeed, feasible S can be 
generated as follows: first choose a subset of {F(ui), r(u 2 ), r(u3), F(u 4 )} of size k whose elements 
will contain a flaw in S ((^) possibilities), and then for each selected F(uj) 

• choose a node u' matched to Vi (at most 2 n — 1 possibilities), and 

• choose an edge in G of the same color as {vi, u'} (at most q — 1 possibilities). 

Let us set value /i/ for all / G F to the same constant /r > 0, then we get 


4 

^(S) < •/= ( 1 -k (2n-l)(g-l)/i)^ 

5eiiid(r(/M)) k=o 
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By inspecting Algorithmwe can conclude that Af = (2n — 3)(2n — 1) for each f ^ F. Thus, we 
get the following condition: there must exist ^ > 0 such that expression 


6 = 


1 




{2n — 3)(2n — l)/i 
is a constant smaller than 1. Denote /3 = (2n — 3)(2n — 1)/U, then 


1 




(3 


0 < ^ • ( l + 2n-(yn) • • (l + o(l)) ) = 


(l + i7/3 + o(l)r 

/3 


The last expression will be smaller than 1 (for a sufficiently large n) if /? = 3 and 7 = 0.21, where 
we used the constants from [9]. □ 

Let us now estimate the expression in ([^. We have |D| = (2n — 1)!! and log|D| = 0(nlogn). 
When supp(a;^'^^^) = for some G D, we get 7 ^“^* = |D| and thus T = D(nlogn). If, 

on the other hand, = uj then we can write 


ReU^gn lnd[F^) RCF f&F 

Observing that \F\ < {2n)‘^q = 0{n^) and fj, = 0(l/n^), we obtain that T = 0{\F\ log(l + ^)) = 
0{n^ log(l + 0(^))) = 0{n^ ■ = 0{n). Thus, choosing = 00 leads to a better bound than 

initializing the algorithm with some fixed state This may not be surprising, given that in 

the latter situation we need to consider the worst case. Note that a linear bound on the expected 
number of resampling steps has also been shown in [9]. 

We can also compute a bound on the expected number of rounds of the parallel version (Algo¬ 
rithm]^, assuming that = uj. From Theorem]^ we get 

T = 0(logE/eF/^/) = 0(log(|F| ■ n)) = 0(log(n3 • ^)) = O(logn) 

It should be noted, however, that at the moment it is not known whether a round of Algorithm]^ can 
be implemented efficiently (i.e. in an [expected] polylogarithmic time) for matchings in a complete 
graph. Such implementation has only been shown for permutations [ 8 ]. We conjecture that the 
technique in [S] can be extended to matchings in a complete graph, but leave this question outside 
the scope of this work. 


A Counting stable sequences: Proof of Theorem 14 


We say that a sequence ip = (/i,..., A) with s > 1 is strongly stable if 

(i) Ir G Ind(T) for each r G [s], 

(ii) Ir+i F T{Ir) for each r G [s — 1], and 

(iii) Ir ^ 0 for each r G [2, sj. 

(Compared to Definition 12, we added condition (iii), and in (ii) replaced condition 7^+1 C r+(A) 
with a stronger condition Ir+i C T{Ir)). For a stable word W let ipw = (Li,...,/^) be the 
corresponding stable sequence; if W is empty then (pw = (0)- 


Proposition 34. For any W G Stab^r the sequence ipw is strongly stable. 
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Proof. We need to show that the sequence ipw = satisfies additionally property (ii). 

We will prove this under the assumption that there exists a walk r = W. (The case when there 
exists a walk t = REV[W] is completely analogous). 

Let W = Wi... Ws be the partitioning of W given in Definition 13 It suffices to prove that 
if a flaw / is present in adjacent segments Wr and Wr+i then f ^ f. Suppose not: f f- 
Then by Lemma Bb) there exists flaw g between the two occurences of / with f ^ g. We have 
WrWr+i = .( 7 .We have either g G Wr or g G Wr+i, and so we must have f oo g - a 

contradiction. □ 

For a sequence tp = with s > 1 let R^p = Ii be the first set in the sequence, 

and denote \ip\ = ~ OreL] O/e/r Stab be the set of strongly stable 

sequences, Stab(i?) = {ip G Stab : = i?}, Stab(t) = {(p G Stab : \p\ > t} and Stab(i?, t) = 

Stab(i?) n Stab(t). 

Clearly, a 7r-stable word W can be uniquely reconstructed from the corresponding sequence pw 
Thus, W I—)• pw is an injective mapping from Stab 7 r(i?, t) to Stab(i?, t). Also, Aw = 


W G Stab^(i?, t). This means that Theorem 14 will follow from the following result. 


Theorem 35. Suppose that (p, ~) satisfies either the cluster expansion condition (2b) or the 
Shearer’s condition from Definition^ Then 


Y, < p{R) • 6^ 

(f£StSLb{R,t) 


'iR G Ind(F) 


(16) 


Proof. First, assume that the Shearer’s condition holds. In this case the claim has been proven 
in UDi- To elaborate, let p G be the vector from Definition and define 


P<p = 


UUpf 

re[s] f&Ir 


(17) 


Let Stab' be the set of sequences = {R,..., R) that satisfy conditions (i) and (ii) given in the 
beginning of this section. For any integer i we can write 


E 


Pv < 


E 


Pip < 


length of (f is at most i 


qr{p) 

qzip) 


WR G Ind(F) 


(18) 


where the first inequality holds since for any p = (R,..., R) G Stab(i2, t) with s < £ there is a 
corresponding sequence p' = {R, R,... ,R,0,... ,0) G Stab' of length i, and the second inequality 
appears implicitly eq. (2) in [11] and as Lemma 5.10 in [9]. Taking a limit £ 00 in (18) and 

observing that p{R) = we get Ylipesta.h{R t) Pp — From definitions we have Xip < p,p ■ 9^ 

for any p G Stab(i?, t), this gives (16). 


Now assume that condition ([2) holds. We say that a pair of subsets {R, S) is independent if 
i? n 5 = 0 and RU S G Ind(T). For such pair let Sta.'b{R, S,t,£) be the set of sequences of the 
form p = (i?, R,... ,R) with s < £ that satisfy one of the following: 

• |s| = 1 (i.e. p = {R)) and S = 0; 


"^Note that Bissacot et al. |3] proved that the cluster expansion condition implies Shearer’s condition, so it would 
suffice to prove just the second claim. However, the definition of the cluster expansion condition in [3] was slightly 


stronger: the sum in (2bl was taken over S € Indfr"*"!/)), while we sum over S € Ind(r(/)). Due to this annoying 


technicality we consider the two cases separately in the proof. 
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• |s| > 2, S' = /2 — r(Ii) and {I 2 , ...,/«) £ Stab. 

It can be seen that Stab(i2, 0, t, 00 ) = Stab(i2, t) for an independent set R. 

We say that a tuple {R,S,t,£) is valid if {R,S) is an independent pair and t > 0, £ > 1 are 
integers. We will prove the following for any valid tuple {R, S,t,£): 

x^< KR)KS) • 0 ^ (19) 

if£StSLh(R,S,t,£) 


Let us introduce a partial order [I on tuples {R, S, t, £) as the lexicographical order on vectors {£, |i2|) 
(the first component is more significant). We use induction on this partial order. The base case is 
given by a tuple {R,S,t, 1). We can assume that S = 0 and t < |i?| (otherwise Stab(i?, S, t, 1) is 
empty). In this case Stab(i?, S, t, 1) contains a single sequence ip = (i?), and so 


= HXf < ll{pf9) = p{R)-e\^\ < p{R)-0^ 

<peStab{R,0,t,l) f&R feR 


where we used inequality Ay < Hf ■ 0 that follows from condition (2b) with 5 = 0. 

Now consider a valid tuple {R,S,t,i) with £ > 2, and assume that the claim holds for lower 
tuples. Two cases are possible. 

• R = 0. We have a natural isomorphism between sets Stab(0, 5, t, i) and Stab(5, 0,t, £ — 1), 
namely Stab(0, 5, t, £) = {(0, Ii,..., R) \ (Li, ... ,Ir) £ Stab(5, 0,t,£ — 1)}. This gives 


^ A^ = A^ < ia{S) ■ 0^ 

Ip GSt ah {0 ,S (^GStab(S',0,t,£—1) 


where in the last inequality we used the induction hypothesis. 

• R P 0. Pick f G R, and denote R~ = R — {/}. Let f (/) be the set of flaws g G r(/) that 
(i) can occur in the second set of a sequence ip G Stab(i?, 5, t, £), and (ii) do belong to r(/') 
for any /' G R~. Formally, f(/) = {g G r(/) \ g oo f' for all f' G RU 5}. We can write 


E V 

= 

Y 

ip^Stah{R,S,t,£) 


Teind(f (/)) tp£Stab{R- ,SUT,t-l,i) 


(a) 

< 

Ay g{R-)g{SuT)-9^-^ 



Teind(f(/)) 


(b) 

g{R)piS)0^-^ ■ ^ Y 

/ry 

Teind(r(/)) 


(c) 

< 

g{R)piS)0^-^ ■ 0 


where (a) is by the induction hypothesis, (b) is true since g{R ) = g{R)/gf and /r(5UT) = 
and (c) follows from condition (2b). 

□ 
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